A throughput maximised parallel architecture for 2D fast Discrete Pascal Transform

نویسندگان

  • Ming Ming Wong
  • Mou Ling Dennis Wong
  • Ismat Hijazin
چکیده

In this paper, we present a fully pipelined parallel implementation of a two dimensional (2D) Discrete Pascal Transform (DPT). Our approach rst makes use of the properties of the Kronecker product and the vec operation on matrices to form an alternate 2D DPT representation suitable for column parallel computation. Next, we lend ourselves to the results from Skodras' work in 1D DPT to achieve the nal architecture for fast 2D DPT. With a fully pipelined implementation, the architecture possesses an initial latency of 2(N − 1) clock cycles and a maximum throughput of one complete two dimensional transform every clock cycle, given any input matrix of size N×N . To evaluate our work, our results obtained from actual FPGA implementation were benchmarked against results from other previous works.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A rescheduling and fast pipeline VLSI architecture for lifting-based discrete wavelet transform

A lifting based 2D DWT with efficient folded architecture and parallel scanning is being proposed. The architecture results in lesser hardware complexity and memory requirement due to multiplexing of 2 stages of lifting architecture. The 2D DWT architecture is realized by cascading two 2D processing elements. The coefficients for the lifting stage were chosen according with 9/7 filter. The 1D p...

متن کامل

Unified Architecture for Discrete Fourier and Inverse Cosine Transforms

A unified architecture for radix-2 decimation-in-time fast Fourier transform and inverse discrete cosine transform of type II is presented based on constant geometry formulation of the algorithms. The parallelism of the architecture can be specified to meet various throughput and area requirements. The type and size of the transform in the architecture can be configured in run-time. In addition...

متن کامل

A high-throughput, memory efficient architecture for computing the tile-based 2D discrete wavelet transform for the JPEG2000

In this paper, the design and implementation of an optimized hardware architecture in terms of speed and memory requirements for computing the tile-based 2D forward discrete wavelet transform for the JPEG2000 image compression standard, are described. The proposed architecture is based on a well-known architecture template for calculating the 2D forward discrete wavelet transform. This architec...

متن کامل

High Throughput Reconfigurable Resource Sharing Parallel Architecture for a 2D Lifting Based DWT

Article history: Received 16 April 2015 Accepted 12 June 2015 Available online 1 July 2015

متن کامل

Distributed Memory Parallel Architecture Based on Modular Linear Arrays for 2-D Separable Transforms Computation

A framework for mapping systematically 2-dimensional (2-D) separable transforms into a parallel architecture consisting of fully pipelined linear array stages is presented. The resulting model architecture is characterized by its generality, high degree of modularity, high throughput, and the exclusive use of distributed memory and control. There is no central shared memory block to facilitate ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Computers & Electrical Engineering

دوره 36  شماره 

صفحات  -

تاریخ انتشار 2010